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We construct the networks of collaboration between partners for projects carried out with the 
support of European Commission Framework Programs FP5 and FP6. We analyze in detail these 
networks, not only in terms of total number of projects, but also for the different tools employed, the 
different geographical partitions, and the different thematic areas. For all cases we find a scale free 
behavior, as expected for such social networks, and also reported in the literature. In comparing 
FP5 to FP6, we show that despite a decrease in the number of signed contracts, and the total 
number of unique partners, there is an increase in the average number of collaborative partners per 
institution. Furthermore, we establish a measure for the central role (hub) for each country, by 
using the Minimum Spanning Tree (MST) , which we construct in detail for each thematic area (e.g. 
Informatics, Nanoscience, Life Sciences, etc.). The importance of these network hubs is highlighted, 
as this information can be used by policy planners in designing future research plans regarding the 
distribution of available funds. 

PACS numbers: 89.65.-s Social and economic systems; 89.75.-k Complex systems; 89.90.-|-n Other topics in 
areas of applied and interdisciplinary physics 



I. INTRODUCTION 

Complex network theory is a fast emerging field 
in physics research. Tools of complex network analysis 
were implemented for the analysis of a variety of sys- 
tems in many fields, spanning a very wide spectrum of 
applications^ Examples are the Internet J1,Q, the World 
Wide Web ^] , communication networks food webs @ , 
sexual contact networks [7|, scientific collaboration net- 
works Q, economic networks [1, 113, [O, d, [3 ' ^^'^ many 
more. These tools use a novel approach to deal with 
nature and society and they can lead beyond usual sta- 
tistical analysis, uncovering properties not palpable by 
classical means. 

One important network is formed by the collaboration 
of scientific and industrial institutions in specific projects 
of a finite duration. This typically involves a group of uni- 
versities, research centers, and private companies which 
join their efforts in order to solve a particular problem. 
Typically, such projects originate by the initiative of the 
scientists involved, who respond to the specific calls for 
proposals by the funding agencies. It would thus be of 
interest to investigate the patterns by which scientists 
connect among themselves, which in turn would provide 
valuable information to the government and to the pol- 
icy makers when looking to formulate their strategy for 
crucial everyday problems in environment, energy, com- 
munications, etc. 

The European Commission (EC) has established the 
European Research Area (ERA), which has been given 
the responsibility to guide Europe to the highest quality 
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research that can be performed in the world today. To 
achieve its goal the ERA sponsors the so called Frame- 
work Programs (FP), which are large funding programs 
for a certain length of time, with a fixed budget, and 
which are expected to produce tangible results. In or- 
der to ensure the diffusion of knowledge and funding 
resources with the FPs, international collaborations are 
strongly encouraged. There has been a total of seven 
FPs up to now. The latest FP is the 7'^ Framework 
Program (FP7) which runs in the period 2007-2013 with 
a total budget of ~ €51 billion. This program is com- 
prised of different funding schemes with varying scopes, 
targets, and different number of participants. Among 
such schemes are: the STREP scheme which focuses on 
a specific target problem in a consortium of about 8-10 
partners, the Integrated Projects (IPs) which are larger 
collaborations of about 20 - 25 partners, dealing with 
problems in a broader area but with a unifying theme, 
the Marie-Curie Actions (MCA) that are focused on the 
exchange of graduate students and postdoctoral scholars, 
usually between 2 partners, and several more schemes. 

In this work we analyze the collaboration network for 
the previous two FPs which have been now concluded, 
the FP5 and the FP6, and were carried out in the peri- 
ods 1996-2001 and 2002-2006, respectively. We use the 
entire dataset of all projects that were approved, funded, 
and carried out in the FPs. The information for such 
data can be obtained from CORDIS [l^. To give an 
overview of the size of the projects, we simply mention 
that in FP5 there were 16558 contracts that were carried 
out by 84267 partners (but only 27219 unique partners) 
from 147 countries, while in FP6 there were 8861 con- 
tracts that were carried out by 69237 partners (but only 
19984 unique partners) from 154 countries. In a recent 
paper ^ Almendaral el al. studied some properties of 
the FP5 network. They found that the network is scale- 
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FIG. 1: (a) Probability distribution of the degrees for the FP5 
and FP6 collaboration networks, (b) Probability distribution 
of the degrees of the collaboration networks for different in- 
struments of FP6. 
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FIG. 2: Average number of collaborations (w)^ of each node i 
versus the degree ki of the node. With red circles we represent 
the individual partners (19984 institutions), with blue circles 
the cities (6955), with green circles the provinces (1304), and 
with orange circles we represent the countries (154). 



free with an accelerated growth, and due to the hierar- 
chical modularity property, it has a self similar structure. 
They also found that the network features assortative 
mixing, which means that coUaborations among partic- 
ipants of similar size appear easier, and it possesses the 
property of small world. In the present work we start 
by doing a comparative analysis between the FP5 and 
FP6 networks, but eventually we will use tools of graph 
theory [l^ to analyze in more detail the FP6 network. 



II. ANALYSIS OF THE COLLABORATION 
NETWORK 

We construct the collaboration network by considering 
each participant in the FP as a node, and by representing 
the collaborations between partners as links between the 
nodes. Furthermore, we consider as link weight, Wij, of 
the connection between nodes i and j the total number 
of collaboration projects between these two partners. In 
this view we can construct the network using different 
detail levels, depending on the information we would like 
to extract. This "zooming" in or out, is achieved by con- 
sidering as nodes either the countries, provinces, cities, 
or the individual institutions participating in the FP. 

We first construct the network of collaboration be- 
tween individual partners (institutions), and then we cal- 
culate the degree of each node. The degree ki of node i 
is defined as the number of links that are connected to 
this specific node i. In fig. [IJa) is shown the probability 
distribution of the node degrees, P(fc), which is used to 
calculate the probability of finding a node with degree fc, 
for both the FP5 and FP6 collaboration networks. We 
identify two different regions in the plot, as separated by 
a vertical dot line around the value of fc ~ 8. In the 
region to the right of the dotted line, both probability 
distributions have a clear power law tail, P{k) ~ k~'^ . 
For the FP5 network the exponent 7 = 1.85 ± 0.03, a 
value in good agreement with [isj , the maximum degree 
is k„iax — 2784, and the mean degree of the network 
is < A; >= 26.1. For the FP6 network the exponent 
7 = 1.96 ± 0.03. The maximum degree of the network 
for FP6 data is k„iax — 2842, and the mean degree of the 
network is < fc >= 43.6. Such linear decay in the log-log 
plot is typical for many networks [l|,H,H,l3,S[l,[3,[8[. In 
the left region, the probability distribution for the FP5 
is always larger than the corresponding one for the FP6. 
This means that there are more institutions in the FP5 
having a smaller number of connections than in FP6. 
The opposite behavior is found in the right region, where 
the probability distribution for the FP6 is always larger 
than the one for the FP5, which means that the number 
of participants with a larger number of connections has 
been increased. The large increase in the value of < fc > 
reveals that the number of collaborative partners per in- 
stitution has been increased. This means that there are 
more links connecting the individual partners, and these 
links result in an enhancement of the collaboration ac- 
tivity between institutions over time, even though the 
number of signed contracts was decreased by a factor of 
2, and the total number of unique partners also consider- 
ably decreased. Similar behavior, i.e. the increase of the 
mean degree, is found when we consider the collaboration 
network based on the different funding schemes, as we 
see in fig. [Ijb) . We also observe here a scale- free behav- 
ior as with the total number of projects. Therefore, we 
find that the different funding schemes lead to the same 
collaboration behavior, even though the requirements to 
establish collaborations are not entirely the same for each 
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scheme. 

Next, we study the dependence of the mean node 
weight, (w)^ = i Wij , on the node degree k for all 
detail levels that we have available (fig. [2]) . When we con- 
sider the network of individual participants (red dots) the 
mean node weight (w) is almost uniform with a value in 
the interval between (w) — 1 and {w) = 2, for all nodes. 
Therefore, there is no traceable dependence of this value 
to the node degree fc^. We assume that this behavior is 
due to the fact that, among the large number of total 
projects the most frequently appearing case is the case 
of only one or two collaborative projects between the in- 
dividual institutions, and thus, the mean node weight is 
almost equal to the node degree found here. But this 
picture changes as we zoom out to cities, provinces, and 
eventually to country level. For the case of countries, as 
it is shown in fig. [2] with orange circles, there is a ten- 
dency of nodes with high degree to have higher mean 
node weight (w) values. For example, we find that Ger- 
many, one of the leading countries in research in Europe, 
has (w) — 1053 and (k) = 142. This happens because 
there are only a few countries which are very far from 
the average in terms of the projects carried out by them, 
as compared to the rest of the countries. Such countries 
(see later in fig.[3|), are the hubs of the network and play a 
more important role in its structure. The data for cities 
and provinces (blue and green circles in fig. [2|) behave 
similarly, and are located in the intermediate range of 
the plot. 

To understand better how the aforementioned behav- 
ior occurs, we discuss the example network of fig. [3l[a), 
where a pictorial representation of the collaboration net- 
work in different levels is shown. In this network, with 
black edges we represent possible connections among in- 
stitutions (small black circles) and with blue edges the 
connections between countries (large blue circles). The 
number of connections between all institutions belonging 
to two different countries is the weight of the connec- 
tion between these two countries. We see that while for 
the institutions the mean node weight is equal to the 
node degree, for countries their value is different. In this 
figure all three countries have k = 2 but for country I 
{w)j — 2.5, for country II {w)jj = 3.5, and for country 
III {w)jjj — 3. From this example we see that coun- 
try II plays more important role in the network, since 
it has the largest average weight, even though the other 
two countries have larger number of institutions. This 
happens because one of the institutions of country II is 
connected to many different institutions of other coun- 
tries, and thus, it increases the connection strength of 
the country that it belongs to. 

In the following analysis we focus on the collaboration 
network among countries participating in the FP6, as 
shown in fig. [3Kb). An edge connecting two nodes, i and 
j, of this network represents the presence of at least one 
collaboration project between institutions from country 
i with institutions in country j. The weight Wij of such 
an edge represents the total number of collaborations be- 




FIG. 3: (a) Pictorial representation of how the network of 
collaborations among countries is created. In this example 
with black edges we represent possible connections among in- 
stitutions (small circles) and with blue edges the connections 
between countries (large circles). The number of connections 
between institutions belonging to two different countries is 
the weight of the connection between them, (b) The actual 
network of collaborations among the 154 countries participat- 
ing to at least one FP6 project, (c) The Minimum Spanning 
Tree (MST) of the collaboration network of the 154 countries 
participating at least to one FP6 project. The green nodes 
represent the EU15 member countries, the blue nodes repre- 
sent the 10 new member countries of EU, and the red nodes 
represent all the other countries. 



tween institutions in these two countries. We transform 
this weight to a distance measure dij — 1 /wij , in such a 
way that the smaller the distance, the stronger the col- 
laboration between countries. By default dij is defined in 
the interval (0, 1], and it takes its maximum value when 
there is only one collaboration project between a pair of 
countries, Wy = 1. From this network we extract the 
Minimum Spanning Tree (MST). In order to construct 
the MST we use the following algorithm. First, we start 
with the disconnected network, which includes only the 
nodes with no links between them. Next, we sort all 
distances from highest to lowest. Next, we start adding 
links to the disconnected network in increasing distance 
order, i.e. we connect firstly the nodes with the smallest 
distances. If a loop is formed, this is not allowed, and the 
link is discarded. This procedure stops when all nodes 
are connected, and no further links can be added. The 
MST is a special graph because it reduces the number of 
links of the network, since as a tree it cannot contain any 
loops, while it keeps all the nodes connected with a to- 
tal minimum distance. This structure is a much simpler 
graph than the full network, but it still gives interesting 
information about the system. The usage of spanning 
trees as subnetworks that retain the most meaningful 
connections of the original network is an approach that 
enhanced our understanding in various complex systems. 
For example, in correlation based networks of financial 
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FIG. 4: (a) Minimum Spanning Tree (MST) indicative of the 
collaboration activity in Europe, using all thematic areas of 
the FP6 for all EU25 countries. MSTs indicative of the collab- 
oration activities, for all EU25 countries, in the following the- 
matic areas : (b) "Nanotechnologies and Nanosciences" . (c) 
"Research and Innovation" . (d) "Aeronautics and Space" . (e) 
"Food Quality and Safety" . Colour code; with green nodes we 
represent the EU15 countries and with red lines we represent 
the strongest links between them, while with yellow nodes we 
represent the 10 new member countries of EU, and with blue 
lines we represent the strongest links of these 10 countries. 



markets the Minimum Spanning Tree technique [9j has 
led to the identification of clusters of stocks that result 
to a meaningful taxonomy. 

The MST of the FP6 collaboration network, interest- 
ingly enough, has a star-like structure around some spe- 
cific countries, as it is shown in fig. [31 These countries, 
that act as hubs (strongly connected nodes) are Germany, 
United Kingdom, France, and Italy. The most connected 



hub, and the one with the most central role is Germany, 
that is located at the center of the MST in fig.[3I^c). This 
central role becomes clearer if we examine the stronger 
connections only among countries belonging to the Euro- 
pean Union. From this examination we find that almost 
all 15 European countries are directly linked to Germany, 
as it is shown in more detail in fig. (Ha), except Ireland, 
that is linked to the UK. When we consider the 10 new 
countries that joined EU in 2004 we see that 6 of them es- 
tablished stronger connections with Germany, but 4 with 
the UK and Italy. 

Similar behavior is found if we calculate the MST of 
the collaboration network between countries for every 
thematic area separately, as it is shown in figs. IHb) - 
IDJe), where we show the MST that corresponds to col- 
laboration activity among European countries, for four 
(4) different thematic areas. Namely, the thematic areas 
presented in figs. HJb) - SKe) are: the "Nanotechnologies 
and Nanosciences" , the "Research and Innovation" , the 
"Aeronautics and Space", and the "Food Quality and 
Safety". In part (b) of fig. [4] we see that Germany (DE) 
is the central hub, in (c) it is Italy (IT), in (d) it is France 
(FR), and in (e) it is the United Kingdom (UK). The ob- 
servation that different countries act as hubs in different 
thematic areas is apparent and very interesting. 

The European Commission collaboration projects fo- 
cus on a total of 16 different thematic areas. Due to 
lack of space, we show here the map with the MST only 
for four of them, and for clarity we focused only on the 
connections between the 25 EU member countries. The 
results for all 16 thematic areas, by taking into account 
links between all world countries participating in FP6 
projects, are given in table [H From this Table we can 
more easily locate the most central European countries 
(EU25) in terms of collaborations, by using the node de- 
gree of the MST. Following this methodology we find 
that Germany (DE) is the central hub for 62.5% of the 
thematic areas, the United Kingdom (UK) 25%, France 
(FR) 6.25%, and Italy (IT) 6.25%. 

We note here that by performing the same analysis to 
the FP5 collaboration network, we find the same quali- 
tative results, i.e. Germany is again the most connected 
hub, but the MST of the FP5 has only 45% common 
links with the MST of FP6. This is an indication that 
the collaboration network is not static, but it is of dy- 
namic nature, and changing with time. 



III. DISCUSSION 

In this work we studied the evolution of scientific col- 
laborations arising from the support of the European 
Commission, by analyzing the networks formed by uni- 
versities, research centers, and private companies, which 
have joined their forces to address a particular scientific 
problem, under a funding scheme provided by the Euro- 
pean Framework Programs. We did a comparative anal- 
ysis between the FP5 and FP6, and we found that they 
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TABLE I: An indication of the "centrality" of each European country (EU25) for every thematic area, as it is calculated from 
the Minimum Spanning Tree (MST) connectivity. The thematic activity codes have the following meaning AO: All Thematic 
Areas of FP6, Al: Citizens and governance in a knowledge-based society, A2: Support for the coordination of activities, A3: 
Euratom, A4: Food quality and safety, A5: Horizontal research activities involving Small- Medium Enterprises (SMEs), A6: 
Research infrastructures, A7: Specific measures in support of international cooperation, A8: Information society technologies, 
A9: Life sciences, genomics and biotechnology for health, AlO: Human resources and mobility. All: Nanotechnologies and 
nanosciences, knowledge-based multifunctional materials and new production processes and devices, A12: Policy support and 
anticipating scientific and technological needs, A13: Research and innovation, A14: Science and society, A15: Aeronautics and 
space, A16: Sustainable development, global change and ecosystems. 
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are similar in their overall structure. We found that, 
even though the number of signed contracts, and the to- 
tal number of unique partners from FP5 to FP6 has been 
decreased, the collaboration among institutions has in- 
creased, as it is revealed by a large increase in the mean 
degree of the network. 

We identified different dependence patterns of the av- 
erage node weight {w)^ = -^"^jWij on the node degree 
k, as we "zoom out" from the level of participants, to 
the level of cities, provinces, and eventually to countries. 
This behavior showed that in terms of international col- 
laborations, the more active the individual institutions of 
a country are, the more central role this country plays in 
the network structure. This centrality is demonstrated 
by using the MST of the collaboration networks. The 
MST analysis was able to distinguish the different roles 
of the hubs of the network for the different thematic ar- 



eas of the FP6. The four larger countries, i.e. Germany, 
Prance, Italy and the UK, dominate by their participance 
in all FPs. The MST analysis showed that Germany 
is the principal overall hub when all fields of research 
are combined together. These findings can be taken into 
consideration by policy makers and government agencies 
when deciding future funding schemes. 
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